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ABSTRACT 

We present a study of the average properties of luminous infrared galaxies detected directly at 24 ^m 
in the COSMOS field using a median stacking analysis at 70 /^m and 160 /im. Over 35000 sources 
spanning 0<z<3 and 0.06 mJy<524<3.0 mJy are stacked, divided into bins of both photometric 
redshift and 24 /^m flux. We flnd no correlation of S'7o/S'24 flux density ratio with 524, but flnd that 
galaxies with higher 5*24 have a lower 5i6o/'S'24 flux density ratio. These observed ratios suggest that 
24 /xm selected galaxies have warmer spectral energy distributions (SEDs) at higher mid-IR fluxes, and 
therefore have a possible higher fraction of active galactic nuclei. Comparisons of the average S70/S24 
and >S'i6o/>S'24 colors with various empirical templates and theoretical models show that the galaxies 
detected at 24 /im are consistent with "normal" star-forming galaxies and warm mid-IR galaxies such 
as Mrk 231, but inconsistent with heavily obscured galaxies such as Arp 220. We perform a x^ analysis 
to determine best fit galactic model SEDs and total IR luminosities for each of our bins. We compare 
our results to previous methods of estimating Lir and find that previous methods show considerable 
agreement over the full redshift range, except for the brightest ^24 sources, where previous methods 
overpredict the bolometric IR luminosity at high redshift, most likely due to their warmer dust SED. 
We present a table that can be used as a more accurate and robust method for estimating bolometric 
infrared luminosity from 24 /im flux densities. 
Subject headings: galaxies: evolution - galaxies: high-redshift - galaxies: statistics - infrared: galaxies 



1. INTRODUCTION 

Although rare in the present-day universe, lumi- 
nous infrared galaxies (LIRGs) were much more nu- 
merous in the past, and they may have played a sig- 
nificant role in the evolution of a large fract ion of 
L > L* galaxies ([S anders & M irabell [l99 6: Blai n et al.l 
I200I iLaeache et al.l 120041: ILe Floc'h et al.ll2005D . How- 
ever, their exact contribution is still poorly understood 
due to two limitations that have plagued deep sur- 
veys performed so far: (1) the difficulty to identify the 
most obscured and distant of these objects, as well as 
measure their redshifts (they are often f aint at opti- 
cal w avelengths because of dust extinction, iHouck et al.l 
120051 ). and (2) the difficulty to accurately character- 
ize their nature (bolometric luminosity, mass, physi- 
cal processes powering their energy output). Further- 
more, because of limited sensitivity of current space- 
{Spitzer MIPS) and ground-based (SCUBA, BOLO- 
CAM, MAMBO, AzTEC,...) observations in the far- 
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IR/submillimeter, only a small number of the most lumi- 
nous of these sources has been studied in detail. In ad- 
dition, at high redshifts there are significant limitations 
due to confusion, which results from the very large instru- 
ment beam characterizing current far-IR/submillimeter 
observations. 

Many previous studies of LIRGs have been based on 
data obtained with the Spitzer Space Telescope, in par- 
ticular with the Multiband Imaging Photometer (MIPS, 
Rieke et.al. 2004) at 24 /im, the detector's most sensitive 
band. Using extrapolations based on libraries of galac- 
tic infrared (IR) spectral energy distributions (SEDs), 
the observed 24 /im flux is converted to a bolometric 
IR luminosity, Lir = L(8-1000/im), which is then used 
to calculate properties such as instantaneous star forma- 
tion rate (SFR). However, at higher redshifts the 24 /im 
band probes shorter rest frame wavelengths, probing rest 
frame 12 /im at z ^ 1, and rest frame 8 /im at z ~ 2. 
The typical peak of the IR SED of star-forming galax- 
ies and galaxies containing active galactic nuclei (AGN) 
falls around 50-200 /im; at higher redshifts, the 24 /im 
band probes wavelengths farther away from the peak of 
the IR SED and begins to be heavily affected by broad 
mid-infrared PAH emission and silicate absorption fea- 
tures. 

Observations at longer wavelengths, such as in the 
Spitzer MIPS 70 /im and 160 /im bands (which probe 
rest frame 24 /im and 54 /im at z ~ 2, respectively), are 
needed to more accurately characterize the bolometric 
luminosity, especially at higher redshifts. However, the 
MIPS 70 /im and 160 /im bands are significantly less sen- 
sitive and have worse angular resolution than the 24 /im 
band. This leads to a drastic decrease in the number of 
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sources directly detected at 70 /zni and 160 /im, and the 
galaxies that are detected are biased toward the most 
luminou s sources. Therefor e, we use a stacki ng analy- 
sis (as in lDole et al . 2006; P apovich et alJ[2007l ) to study 
the average 70 /im and 160 /im flux densities of galaxies 
detected at 24 /im. In using a stacking analysis we lose 
the ability to study individual galaxies, but find aver- 
age properties of galaxies that would otherwise be unde- 
tectable. 

In this work we explore the average mid- to far-IR 
flux densities of galaxies detected at 24 /im and derive a 
more accurate method to estimate bolometric IR lumi- 
nosity. To accomplish this, we measure stacked 70 /xm 
and 160 /zm flux densities of galaxies detected at 24 /urn 
in the Cosmic Evolution Survey (COSMOS) field, binned 
in both redshift and 24 /im flux. We use these stacked 
fluxes to examine the evolution of mid- to far-IR colors 
of galaxies as a function of luminosity and redshift. Our 
stacked fluxes are fit to libraries of galactic IR SED tem- 
plates, from which we der ive an estimate of the average 
bolometric IR luminosity. iPapovich et al.l ()2007fl carried 
out a similar study employing stacking at 70 fiia and 
160 /im, but their analysis was limited by area, with a 
significantly smaller number of sources. With an area 
almost 10 times larger, we obtain more reliable statistics 
and the ability to bin our sources in narrower bins of 
redshift and fiux. Our stacked fiuxes will eventually be 
merged with Herschel PACS (100 & 160 /im), Herschel 
SPIRE (200-500 /im), and SCUBA2 data to get the best 
sampled SEDs of the high- 2; literature. 

Throughout this work we denote flux density, fi, in 
MIPS 24 /im, 70 /im, and 160 /im bands as 5*24, S'70, and 
"5*1601 respectively. When calculating rest-frame quanti- 
ties, we use a cosmology with il^ — 0.3, A — 0.7, and 
Ho = 70kms-i Mpc'^. 

2. DATA 

We use data f rom the Cosmic E volution Survey 
(COSMOS) field (IScoville et al.l \200T\) . a ~ 2 deg^ 
field centered at right ascension 10''00™28''.6, declina- 
tion 02°12'21'.'0 (J2000) with extensive muhiwavelength 
imaging and spectroscopic coverage. In this study, 
we make use of th e COSMOS Spitzer (S-COSMOS; 
I Sanders et al.l [20071) observations, specifically the data 
taken by MIPS in the 24 /zm, 70 /im, and 160 /im bands. 



2.1. 



fim Catalog 



The 24 / im data reduction and source extraction are 
detailed in iLe Floc'h et al.l ()2009[ ). The 24 /im sources 
were detected using the automatic procedu re of the 
Sextractor software ([Bertin &: ArnoutsI 119960 . and the 
flux densities were measured with multiple iterations of 
the point-spread func tion (PSF) fit ting technique of the 
DAOPHOT package (|Stetsonlll98"7[ ). Following the con- 
vention adopted by the Spitzer Science Center, a stellar 
10,000 K blackbody spectrum was assumed as the refer- 
ence SED for the 24 /im flux density measurements. The 
final source list is complete to more than 90% above a 
24 /im flux of ^24 ^ 80/iJy, and according to simulations, 
is still reliable down to fluxes as faint as 60 /iJy, despite a 
lower completeness of 75%. The source list we use for our 
stacking analysis includes all sources with 6*24 > 60/iJy. 

2.2. 70 fim and 160 fj,m Mosaics 



The MIPS 70 / im and 160 / im da ta were reduced 
and processed by iFraver et al.l (|2009[ ). In short, the 
data were reduced using the Germanium Reprocessing 
Tools (GeRT, version 20060415) and additional special- 
ized scripts developed for processing survey data from the 
MIPS- Germanium 70 /im and 160 /im detectors. The fi- 
nal 70 /im and 160 /im mosaics have an image pixel scale 
of 4" and 8" and point-source noise (la) of 1.7 mJy and 
13 mJy, respectively, although there are local background 
fluctuations across the image depending on the local den- 
sity of sources. Frayer et al. (2009) find 1512 sources at 
70 /im, and 499 sources at 160 /im (> 5. Oct), but these 
detections represent the most luminous sources. In our 
stacking analysis, we do not treat these sources differ- 
ently than 24 /im sources that were not detected at 70 
/im and 160 /im (see Section [3]). 

2.3. Photometric Redshifts 

The extensive multiwavelength coverage of the COS- 
MOS field leads to photometric redshifts (hereafter 
photo-z) with an accuracy better than e ver achieved in 
any other field, as detailed in lllbert et al.l p009). Optical 
counterparts of the 24 /im sources were found from corre- 
lating the data w i th the i^Tg— band COSMOS catalog of 
iMcCracken et al.l (|20100 . as detailed in iLe Floc'h etal.1 
(j2009() . Photo- 2; were then calculated using fluxes in 
30 bands, covering the far-UV at 1550 A to the mid- 
IR at 8.0 /im. The uncertainties in the photo-z depend 
primarily on the redshift and apparent i^ magnitude of 
the source, with errors increasing with fainter and more 
distant galaxies, but a comparison with faint spectro- 
scopic samples in the COSMOS field revealed a disper- 
sion as low as cta2/(i+z3) = 0.06 for sources with 23 mag 
< i 
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< 25 mag at 1.5 < z < 3 (jLillv et al.ll2"00l . 



Approximately 1000 of our 24 /im sources arc also de- 
tected in the X-ray by XMM-Newton (Brusa ct al. 2010), 
and for these source s we use the photo-z's derived from 
ISalvato et al.l (|2009H , who have the best photometric red- 
shifts ever produced for AGNs. In all, we have reliable 
photo-z for 35,797 sources detected at 24 /im (~ 92% of 
the 38679 total 24 /im sources). We do not use the re- 
maining sources without photo-z's in our study, but these 
sources are generally at the low 6*24 end of our sample, 
where we have enough sources to perform a meaningful 
analysis. 

3. ANALYSIS 

To study the average 70 /im and 160 /im properties 
of the LIRGs in the COSMOS field, we employ a me- 
dian stacking analysis to overcome the poor sensitivity 
of the 70 /im and 160 /im MIPS detectors. Stacking of 
IR emission has proven valuable for studies such as av- 
erage 24 /im fluxes in faint galaxies (Zheng et al. 200^ 
and contribution s to the far IR extragalactic background 
()Dve et al.l I2007D . A stacking analysis of 70 /im and 
160 /im flu xes of galaxies s e lected at 24 /im has been per- 
formed bv IPapovich et ahl ()2007t ). However, their analy- 
sis used data taken in the Extended Chandra Deep Field 
(ECDF-S), which covers 775 arcmin^, and includes only 
395 sources. As a result, they do not have significant 
detections in some of their lower 24 /im flux bins. With 
the COSMOS data, we have almost 100 times as many 
sources (stacking efficiency goes as ~ N^), and will be 
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able to make a much m.ore detailed analysis with nar- 
rower bins in both redshift and rest-frame mid-IR lumi- 
nosity. 

We divide our 24 /im source list into bins of both red- 
shift and 5*24 before stacking. Redshift bins ensure that 
the fluxes we stack were emitted at the same rest-frame 
wavelength so that we probe the same parts of the SED, 
and iS'24 bins separate galaxies with different IR luminosi- 
ties. Our redshift and 6*24 bins were chosen to maximize 
the number of sources in each bin while providing the 
best coverage in redshift and flux; Table [1] lists the bin 
limits and the corresponding number of sources in each 
bin. 

3.1. Stacking Methodology 

We begin by taking a 40 x 40 pixel (2.7' x 2.7' at 70 fim 
and 5.3' x 5.3' at 160 /im) cutout centered around each 
24 fim source in a given bin. For reference, the FWHM 
of the 70 iim mosaic is 18.6" and the FWHM of the 
160 /im mosaic is 39". The size of the cutout does not 
affect the measured average (stacked) flux as long as the 
cutout encompasses a large enough area to make a lo- 
cal background estimate. We center each 70 /im and 
160 /im subimage on the astrometric coordinates of the 
24 /im source using a bilinear cubic interpolation, and 
then subtract a local background from each subimage; 
the local background is calculated from pixels exterior to 
~ 1' and ^ 2' at 70 /im and 160 /im, respectively. Before 
creating our stacked image, we rotate each subimage by 
90° with respect to the previous subimage to reduce the 
effects of image artifacts in our analysis. 

We then "stack" these subimages, aligned at the cen- 
ter (on-source) position and calculate a median flux den- 
sity at each pixel position. A median stacking analy- 
sis is preferable to mean stacking because the median 
analysis is more stable and robust to small numbers of 
bright sources. The main problem with a mean stack- 
ing analysis is that it is very sensitive to bright outliers, 
which contaminate on-source flux measurements and in- 
troduce considerable noise from nearby, bright neighbors. 
Most mean stacking studies avoid this problem by re- 
moving bright sources from all images before stacking, 
but this technique solves one problem and creates two 
more. Removing bright sources introduces a slight bias 
against more luminous sources and the resultant stacked 
flux varies based on the exact flux density cutoff chosen; 
someone who chooses to remove all sources > 4cr will 
measure a different flux than someone who chooses to 
remove all sources > 3a. The median stacking analysis 
avoids these problems, but is more difficult to interpret. 
From detailed comparisons of median and mean stacking, 
[White et al. (2007) find that in "a limit where almost 
all the values in our sample are small compared with 
the noise ... it is straightforward to interpret our median 
stack measurements as representative of the mean for the 
population of sources." Our sample fits this description, 
with only ^ 4% of our sources detected at 70 /im, and 
less than 2% of our sources detected at 160 /im, so we 
take the results of our median analysis as representative 
of the mean flux density. 

We calculate the flux of our final stacked image using 
the DAOPHOT-type photometry IDL procedure, APER, 
with photometry aperture of 35" and sky annulus radii 
of 39"-65" at 70 /im (at 160 //m, we use an aperture of 



48" and sky radn of 64"-128"). We use these radn in 
conjunction with the published MIPS aperture correc- 
tions for a 10 K blackbody given by the Spitzer Science 
Center (1.48 at 70 /im and 1.642 at 160 /im) to estimate 
the total flux density from our stacked source. 

3.2. Uncertainties in Stacking 

We measure an error in our stacked flux densities from 
the variance in the local background of the stacked im- 
age and the uncertainty in the mean sky brightness. The 
absolute calibrati on of the MIPS dete c tor at long wave- 
lengths is ~10% ([Gordon et all 120071: iStansberrv et al.l 
[2OO7J), and this dominates the errors in our stacked im- 
ages in all but the noisiest of bins. Since a median anal- 
ysis is a ranking measurement, we find an error in the 
median by sorting each pixel and then measuring the 
difference between the middle (median) value and the 
value that is N^^"^ ranks away from the middle, where 
N^/^ is the Poissonian noise from a bin with N sources. 
For all bins, this represents an almost negligible source 
of error. 

To test for confusion from nearby bright sources, we 
searched the 24 /im catalog for nearby sources that would 
fall within the apertures used to measure the 70 /im 
and 160 /im flux densities. We find that only r^ 3% of 
our sources have a neighbor within the (larger) aperture 
used to measure the 160 /im fiux. However, the location 
of each of these nearby neighbors relative to the target 
source will not be uniform, which suggests that the con- 
tribution of nearby neighbors detected at 24 /im to our 
final stacked fiux should be negligible. The fraction of 
galaxies with neighbors within the 160 /im aperture is 
fairly constant in all bins. 

Confusion from faint sources can also add uncertainty 
to flux measurements; galaxy clustering suggests that the 
confusion from faint sources will generally be more sig- 
niflcant near detected sources than at off-source back- 
ground positions. The proper method to account for con- 
fusion from faint sources is still currently debated, but 
the uncertainty is expected to be important mostly for 
data at very long wavelengths, such as in the submillime- 
ter regime, which is generally confusion-limited (H. Dole 
2010, private communication). The COSMOS MIPS 
data used in our stacking analysis are not confusion- 
limited, so we expect a negligible contribution from con- 
fusion. 

4. RESULTS 

4.1. Average 70 iim and 160 fim Flux Densities 

We performed a median stacking analysis for all 56 
bins of redshift and 6*24 in our sample at both 70 /im 
and 160 /im. From a visual inspection of the images pro- 
duced by the stacking analysis, we flnd clean detections 
in 88% of our 70 /im stacks, and 73% of our 160 /im 
stacks. The rest of the stacks can be split into two cat- 
egories: (1) non-detections, which have no signal at all, 
and (2) bad detections, which have a visible, but dis- 
torted signal that does not resemble a clean PSF. The 
non-detections do not have enough signal to noise for an 
average source to emerge, but the bad detections do not 
always have this same problem. The non-detections and 
bad detections are mostly in low flux and high redshift 
bins, although there are a few bad detections in the low- 
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est flux bins with low/intermediate redshifts. The bins 
containing the non-detections and bad detections all have 
a fairly high number of sources, so we believe the lack of 
a clean detection is due simply to the faintness of the 
sources we are trying to stack. In the rest of our anal- 
ysis, we treat the non-detections as upper limits, but 
include the bad detections in our full analysis. Tables [5] 
and [3] list the measured 70 /xm and 160 /im fluxes and 
errors in each of our bins, with upper limits given for 
non-detections. Figure [1] displays an example of a clean 
detection and a bad detection at 70 /im. 

4.2. Evolution of 70/24 M"^ '^^d, 160/24 M"^ Color with 
24 iJ-m Flux 

The mid- to far-infrared flux density ratios (or colors) 
of our stacked galaxies give us insight into the prop- 
erties of the dust emission from these galaxies. Fig- 
ure [H shows the average stacked SJ0/S24 and S'i6o/S'24 
flux ratios of our sources, with each color representing 
a different redshift bin. From the top panel of Fig- 
ure [21 we see that the average 6*70 /S'24 colors fall in 
the range 3 < S'7o/S'24 < 20, which is rou ghly con- 
sistent with results reported in IPapovich et al. (2007), 
who found an average S'70/5'24 ~ 9. They also find that 
sources with S24 > 250/L(Jy have a lower average flux ra- 
tio {S70/S24: w 5), but we do not see a trend of decreasing 
Sro/ S24 flux ratio with increasing ^'24. Our results show 
a mostly flat Sio/S2i color with respect to 524. 

We also find a trend of decreasing 870/824 flux ratio 
with increasing redshift (from 870/824 « 15 at z '^ 0.3 
to 870/824 ~ 5 at 2; ~ 2). This does not necessarily 
mean that high redshift galaxies have lower 870/824 flux 
ratios because at higher redshifts, the observed 870/824 
measures flux ratios at shorter rest-frame wavelengths. 
Galaxies with strong mid-IR polycyclic aromatic hydro- 
carbon (PAH) features have mid-IR (^ 24/xm) emission 
that is flatter than their far-IR (^ 70/im) emission, which 
will lead to a lower observed 870/824 flux ratio at higher 
redshifts, even when observing galaxies with identical 
SEDs. We explore the dependence of 870/824 on red- 
shift further in Section 15.11 through comparisons of our 
stacking results with models. 

Although we do not see a strong trend in 870/824 ratio 
with 5'24, we see a clear trend of decreasing S'i6o/'S'24 
ratio with increasing 824 in the bottom panel of Figured 
The average S'160/5'24 ratios we measure range from 10 < 
'S'i6o/'5'24 ^ 100 and are much larger than the 870/824 
ratios because the 160 fiia band samples fluxes emitted 
at wavelengths closer to the peak of galactic IR SEDs. 
IPapovich et al.l (|2007[ ) did not have high enough signal- 
to-noise in their stacks at 160 fiia to explore S'i6o/'S'24 
ratios, but they find average 5*160 ranging from 3.8 to 
10.5 mjy, broadly consistent with our values of 5*160 in 
the lower 524 bins. 

Figure [5] shows that the brightest 24 /zm sources have 
low 5i6o/524 ratios. Given that ^24 broadly correlates 
with IR luminosity, this means that on average, galaxies 
with brighter IR luminosities have lower 8100/824 flux 
ratios, and therefore flatter spectra. This trend is true 
in all our redshift bins, although the effect is less pro- 
nounced in our highest redshift bins. 

The warmer 81Q0/824 colors in the higher ^24 bins sug- 
gest that these sources, on average, have a higher fraction 
of AGNs (,Sandcrs &: MirabcL1996; .Laurent et al..,200dl ). 



Dust grains in the dusty torus around AGN can be heated 
up to their sublimation temperature (1500 - 2000 K), 
while dust grains in the diffuse interstellar medium and 
star-forming regions of galaxies are stochastically heated 
to lower temperatures around 30 ~ 40 K, or up to 200- 
400 K in HII regions. The emission from the warmer dust 
grains around AGN will mostly dominate in the mid-IR 
wavelengths, while the emission from the colder grains 
in star-forming regions will dominate the far-IR. Thus, 
galaxies powered by AGN will be flatter in their mid-IR 
to far-IR colors. 

It should be noted that although we do flnd a 
trend of flatter 5i6o/524 ratios that is indicative of a 
higher fraction of AGN, our sources are still most likely 
dominated by star fo rmation at the '^85%-90% level 
(jLeFloc'hetal.1 120091 ). Out of our over 35000 sources, 
only ~1000 have X-ray counterparts, suggesting that 
AGN make up a negligible population of our bins, ex- 
cept for possibly the highest 524 bins. Figure [3] displays 
the fraction of sources in each bin that are also detected 
in the X-ray by XMM-Newton. At dim 524 bins, we 
see that XMM-Newton sources indeed account for a low 
percentage of our sources. At bright 524 bins, the X- 
ray-detected sources begin to account for an appreciable 
fraction of our sources, but this does not mean that the 
mid- and far-IR fluxes of these sources are dominated by 
AGN. We discuss this further in Section [5?2] 

5. DISCUSSION 

5.1. Comparison to Models 

In this section, we compare the results of our stack- 
ing analysis with the expected fluxes and colors from 
theoretical models and empirical templates. We flrst 
compare our stacked 870/824 and 5i6o/524 flux ratios 
wi th empirical m o dels o f "normal" star forming galaxies 
bv lDale fcHeloul ()2002l hereafter DALE) and models of 
Arp 220 and Mrk 23 1 from the SWIRE template library 
(jPoUetta et al.l[2007f ). We then constrain the average IR 
luminosity for each bin by fitting to many libraries of 
theoretical models and empirical templates. 

Figure m plots 870/824 and 84^0/824 color as a func- 
tion of redshift, along with the expected values from the 
DALE models of star forming galaxies and the SWIRE 
models of Arp 220 and Mrk 231. The DALE models are a 
one parameter family of models and we show models that 
cover a range of 1 < a < 2.5, which describe normal star- 
forming galaxies with 8.3 < log{Lm) < 14.3. Arp 220 is 
a well-studied galaxy representative of heavily obscured 
ULIRGs, while Mrk 231 is representative of galaxies with 
warm mid-IR colors which are known to host AGN. We 
use the code Le Phar^ developed by S. Arnouts and 
O. Ilbert to determine the flux ratios of the DALE and 
SWIRE models at varying redshifts. Le Phare is a data 
analysis package used primarily to compute photometric 
redshifts, but a preliminary phase of the code also com- 
putes theoretical magnitudes, given SED libraries and 
fllter bands. We can see immediately that the average 
colors determined from our stacking analysis fall within 
the region spanned by normal star-forming galaxies and 
Mrk 231, but our colors do not match those of Arp 220 
at any redshift. This suggests that our 24 /zm selection 
is biased against heavily obscured objects like Arp 220. 

^ http://www.cfht.hawaii.edu/ arnouts/lephare.htinl 
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5.2. Best-fit Model SEDs and Average Total IR 
Luminosity 

We use Le Phare to perform a x^ analysis to find 
best-fit galactic model SEDs for the stacked fluxes 
calculated in each bin. We u se the empiric a l tem - 
plates of iDalefc Heloiil (|2002D . iLagache et all ()2003[ ). 
iCharv fc Elbad (12001 ). and the theoretical radiation 



pressure models of 'Sie benmorgen fc Kriigel (|2007| ) , find- 
ing the SED from each library that best fit our stacked 
data at the correct redshift. The average S24, Sro, and 
5160 in each bin are plotted along with the best fit models 
in Figures [5al -5h. arranged by redshift (Figures 5b-5h are 
only available in the online version of the paper). Fluxes 
from the "non-detcted" bins are shown as upper limits. 
The parameter space spanned by all four of the models 
is shaded to give an idea of the spread of possible SEDs 
that fit the data. 

For our model fits, we use four different libraries of 
"normal" star-forming galaxies. We do have a small 
fraction of XMM-Newton-detected sources that contain 
AGN, especially in the highest flux bins, but this does 
not imply that the mid-IR and/or far-IR fluxes of these 
galaxies is dominated by the AGN itself (many X-ray 
sources are PAH dominated in the infrared). Because of 
the small number of sources, we are unable to perform 
a separate stacking analysis of only these X-ray sources. 
Although wc cannot account for the true contribution 
of AGN contamination, the tight fits we see suggest that 
most of our sources are indeed star formation dominated. 

From the maximum likelihood function of the x^ anal- 
ysis, we estimate a median Ljr and la uncertainties 
in each bin of our stacking analysis. We repeat this 
measurement for each library separately, and then take 
the mean of the four Ljr values to estimate the true 
luminosity. The dispersion of the luminosities derived 
from the different libraries is ^ 6%(3cr) in all our bins, 
which suggests that the four libraries are fairly consis- 
tent in their estimates of the best- fit Lir. We add the 
Icr uncertainties from each library in quadrature to esti- 
mate the error in Lir, and find typical 3a errors around 
3%. The average IR luminosities and errors measured 
for each of our bins are listed in Table H] Bins clas- 
sified as non-detections at both 70 /zm and 160 fim 
are marked as upper limits. As expected, we see that 
total IR luminosity increases with S'24 and with red- 
shift. These galaxies span a large range of IR luminos- 
ity, covering "normal" galaxies (Lir < IO^^Lq), LIRGs 
{10^^ Lq < LiR < lO^^L©), and ultra luminous infrared 
galaxies (ULIRGs; Ljr > 10^'^Lq). To test our uncer- 
tainties, we also found best fit SEDs using 70 /zm and 
160 /zm flux densities that were offset by the errors given 
in Tables [2] and [3] and then measured Ljr from these fits. 
We find discrepancies much less than 6% from this trial, 
suggesting that our derived infrared luminosities are ro- 
bust within the errors in our stacking analysis. 

Table 3] gives a relation between observed S'24 and to- 
tal IR luminosity, with no need for a /c-correction. This 
is an extremely valuable tool given the poor sensitivity 
of longer wavelength instruments, and provides an effec- 
tive way to estimate Ljr when only having 24 fiia data. 
Some caution must be taken when using Table |4] to esti- 
mate LiR,. Although the agreement between the best-fit 
models from many libraries is fairly robust, it is not a 



complete description of the errors. For most of these 
bins, we do not have data on the Rayleigh Jean side of 
our IR SEDs, which means we cannot estimate the cold 
dust component and its contribution to Lir. This means 
that Table m is valid under assumption that the libraries 
used are representative of the diversity of the SEDs be- 
yond 160/J,m/(1 -I- z). These results will be tested in the 
near future with Herschel and SCUBA2. 

Figure [6] shows a comparison of Lir calculated from 
our stacking analysis and Lir calculated from extrapo- 
lating mid-IR fluxes from only S'24 based on the Dale 
SED libraries, as is common practice. In general, the 
two methods are in good agreement at low redshifts, but 
the LiR calculated using only 24 /um data is an overes- 
timate of the true Ljr at high redshifts, especially for 
the brighter S24 sources. This is most likely due to the 
warmer dust SEDs that we find in the bright S24 sources 
fSectio n 14.21). Our re s ults a re consistent with the find- 
ings of iCalzetti et al.l (|2007D . who find that rest-frame 
24 ^m flux is a much better indicator of bolometric in- 
frared luminosity than 8 /.tm fiux, and with the findings of 
iPapovich et al.l (j2007f ). who find that the Ljr estimated 
without taking into account stacked 70 /im and 160 /jm 
fiuxes overestimates the true Lir. To summarize, ex- 
trapolating a bolometric infrared luminosity from a 24 
fiui flux density without taking into account 70 /xm and 
160 /im flux will result in an overestimate of Ljr at high 
redshifts. Table |4] will give a more accurate and robust 
estimate of Ljr at these redshifts. 

6. SUMMARY 

We perform a median stacking analysis on over 35000 
sources detected directly at 24 /im in the COSMOS field 
at 0<z<3 and 0.06 mJy<S24<3.0 mJy to study their 
average fiux densities at 70 /im and 160 /im. Of the 
56 bins used, 95% had detections at 70 /Ltm and 93% 
had detections at 160 /xm. Analysis of the S70/S24 and 
S160/S24 flux density ratios suggest the following. 

• 24 /im sources have average flux-density ratios con- 
sistent with empirical models of "normal" star- 
forming galaxies or with warm mid IR galaxies, like 
Mrk 231, which are known to host AGN. 

• 24 /im sources have average flux-density ratios that 
are inconsistent with Arp 220, which suggests that 
24 /xm is not very useful for finding heavily ob- 
scured objects like Arp 220. 

• Sources with brighter S24 have warmer S160/S24 
flux ratios, decreasing by a factor of 2 from 0.1 < 
S24/niJy < 1.0, which implies that galaxies with 
brighter infrared luminosities have a higher fraction 
of AGN. 

Our stacking analysis provides the largest statistical 
study of the average far-IR flux densities of the faint 
24 /im population. A comparison of the average far-IR 
fluxes to libraries of empirical templates and theoretical 
models allows us to estimate the total IR luminosity of a 
typical galaxy detected at 24 /im within certain redshift 
and S24 bins. We find that previous studies based on ex- 
trapolating LiR from 24 /im data, without far-IR stack- 
ing, generally overpredict the total infrared luminosity, 
especially at higher redshifts. A more accurate method 
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for estimating Ljr using only 24 nui flux and redshift is 
provided in Tabled which takes into account the average 
mid- and far-IR fluxes of 24 /im selected galaxies. 

It is a pleasure to acknowledge the contribution 
from all our colleagues of the COSMOS collaboration. 
More information on the COSMOS survey is available 
at http://www.astro.caltech.edu/cosmos. This work is 
based on observations made with the Spitzer Space 
Telescope, a facility operated by NASA/JPL. Finan- 
cial supports were provided by NASA through contracts 
1289085, 1310136, 1282612, and 1298231 issued by the 
Jet Propulsion Laboratory. We are grateful to Herve 
Dole for insightful discussions on stacking techniques. 
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TABLE 1 

Number of Sources in Each Redshift and 524 Bin 



S24, 








Redshift Range 








(mjy) 


0-0.4 


0.4-0.6 


0.6-0.8 


0.8-1.0 


1.0-1.2 


1.2-1.6 


1.6-2 


2-3 


0.06-0.08 


724 


578 


923 


1281 


1025 


1573 


1006 


1002 


0.08-0.10 


501 


423 


701 


887 


688 


1045 


749 


684 


0.10-0.15 


841 


681 


1011 


1479 


1092 


1395 


1280 


896 


0.15-0.20 


528 


421 


588 


834 


535 


557 


629 


482 


0.20-0.50 


1152 


745 


930 


1354 


628 


597 


790 


604 


0.50-1.00 


408 


153 


136 


187 


61 


75 


73 


78 


1.00-3.00 


228 


35 


28 


26 


11 


21 


20 


21 



TABLE 2 

Average 70 ^m Flux Densities [mJy] and Errors 



524 








Redshift Range 








(mJy) 


0-0.4 


0.4-0.6 


0.6-0.8 


0.8-1.0 


1.0-1.2 


1.2-1.6 


1.6-2 


2-3 


0.06-0.08 


1.65±0.18 


0.60±0.09 


0.64±0.09 


0.63±0.08 


0.73±0.10 


0.77±0.09 


<0.18±0.06 


<0.01±0.07 


0.08-0.10 


1.81±0.20 


1.82±0.20 


1.42±0.16 


0.94±0.12 


1.23±0.15 


1.17±0.13 


0.32±0.08 


<0.19±0.08 


0.10-0.15 


2.49±0.26 


2.37±0.25 


1.36±0.15 


1.23±0.13 


1.41±0.15 


1.33±0.14 


0.45±0.07 


0.43±0.07 


0.15-0.20 


2.13±0.23 


3.21±0.34 


2.26±0.24 


2.23±0.23 


1.88±0.20 


1.79±0.20 


1.00±0.13 


0.85±0.12 


0.20-0.50 


4.85±0.49 


4.91±0.50 


4.13±0.42 


2.95±0.30 


3.21±0.33 


3.12±0.32 


1.29±0.15 


1.48±0.17 


0.50-1.00 


11.42±1.15 


11.49±1.16 


8.30±0.85 


6.41±0.66 


6.06±0.65 


3.45±0.41 


4.66±0.51 


4.17±0.47 


1.00-3.00 


24.35±2.44 


19.39±1.97 


17.80±1.82 


9.40±1.00 


15.93±1.68 


5.68±0.68 


lO.Olil.ll 


6.28±0.72 



TABLE 3 

Average 160 /im Flux Densities [mJy] and Errors 



524 








Redshift Range 








(mJy) 


0-0.4 


0.4-0.6 


0.6-0.8 


0.8-1.0 


1.0-1.2 


1.2-1.6 


1.6-2 


2-3 


0.06-0.08 


6.19±0.68 


2.99±0.43 


3.03±0.37 


2.42±0.33 


1.93±0.29 


4.94±0.53 


<2.29±0.32 


<1.87±0.30 


0.08-0.10 


5.09±0.61 


5.01±0.61 


4.02±0.50 


4.60±0.53 


4.23±0.50 


4.15±0.49 


1.88±0.36 


<2.79±0.40 


0.10-0.15 


7.70±0.83 


7.25±0.78 


8.48±0.88 


7.00±0.73 


5.10±0.55 


6.96±0.73 


4.44±0.48 


4.90±0.56 


0.15-0.20 


8.11±0.88 


8.34±0.93 


8.92±0.95 


6.28±0.69 


9.92±1.05 


10.22±1.07 


6.20±0.69 


6.10±0.72 


0.20-0.50 


15.07±1.52 


15.62±1.60 


15.16±1.54 


12.74±1.30 


11.47±1.18 


17.50±1.79 


6.95±0.73 


9.92±1.05 


0.50-1.00 


28.72±2.91 


27.14±2.78 


30.95±3.17 


24.62±2.55 


19.85±2.19 


17.57±2.03 


18.88±2.09 


17.19±1.89 


1.00-3.00 


51.58±5.21 


51.28±5.30 


39.69±4.13 


26.11±3.01 


32.53±3.97 


<16.16±2.07 


18.94±2.48 


21.76±2.71 



Average Lir 



TABLE 4 
L(8 - 1000/^m) 



IN [log L(. 











Redshift Range 








S24 


0-0.4 


0.4-0.6 


0.6-0.8 


0.8-1.0 


1.0-1.2 


1.2-1.6 


1.6-2 


2-3 


(mJy) 


(0.3) 


(0.5) 


(0.7) 


(0.9) 


(1.1) 


(1.4) 


(1.8) 


(2.3) 


0.06-0.08 


9.96±0.07 


10.41±0.09 


10.74±0.10 


10.90±0.09 


11.12±0.09 


11.66±0.09 


<11.60±0.10 


<11.84±0.12 


0.08-0.10 


9.97±0.07 


10.66±0.08 


10.89±0.07 


11.10±0.12 


11.39±0.09 


11.69±0.11 


11.62±0.12 


<12.01±0.14 


0.10-0.15 


10.16±0.06 


10.82±0.07 


ll.lliO.lO 


11.27±0.12 


11.47±0.10 


11.84±0.09 


11.88±0.13 


12.22±0.18 


0.15-0.20 


10.18±0.07 


10.89±0.08 


11.19±0.08 


11.35±0.08 


11.69±0.10 


12.00±0.08 


12.06±0.11 


12.36±0.20 


0.20-0.50 


10.49±0.07 


11.16±0.08 


11.46±0.10 


11.61±0.10 


11.82±0.11 


12.24±0.17 


12.12±0.20 


12.55±0.22 


0.50-1.00 


10.81±0.07 


11.44±0.06 


11.77±0.10 


11.92±0.10 


12.07±0.11 


12.33±0.19 


12.55±0.15 


12.83±0.20 


1.00-3.00 


11.10±0.07 


11.69±0.07 


11.95±0.07 


11.98±0.09 


12.34±0.08 


12.23±0.11 


12.68±0.20 


13.04±0.23 



Note. — Average redshifts and ^24 for each bin arc given in parentheses. Bins in which both 70 fj,m and 160 fim stacks 
resulted in non-detections are marked as upper hrnits. 
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(a) Clean Detection in bin 
0.20 < S24/mJy < 0.50 



0.8 and (b) Bad Detection in bin 
0.06 < S24/mJy < 0.08 



vith 1.0 < 2 < 1.2 and 



Fig. 1. — Examples of a clean detection at 70 fim and a bad detection at 70 fim. The bad detection has a source in the center that does 
not resemble a clean point source, yet aperture photometry of this source yields a signal-to-noise of ~ 14. The circles in the lower left hand 
corners are the size of the aperture used to measure our stacked fluxes. 
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Fig. 2. — Stacked S70/S24 (top) and Si6o/'524 (bottom) flux ratios plotted as a function of S'24. Each color corresponds to a specific 
redshift bin, with bluer colors for low redshift and redder colors for high redshift bins. "Non-detections" are marked as upper limits, and 
la error bars are plotted for the rest of the bins. 
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Fig. 3. — Fraction of sources in each bin that arc also detected in the X-ray by XMM- Newton. At low 524 bins, X-ray sources account 
for a very small fraction of our sources, but at high flux and high redshift bins the X-ray sources begin to account for a large fraction of 
sources. 
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Fig. 4. — Stacked STo/S2i and S'i6o/S'24 flux ratios compared to models of "normal" star-forming galaxies from DALE (black lines; note: 
other empirical models cover a similar range) and models of Arp 220 (dark green line) and Mrk 231 (pink line) from SWIRE. The DALE 
models are a one parameter family of models, and we use the full range of DALE models, spanning 1 < o < 2.5. The model with the lowest 
a, and also the lowest Lir, is designated by the dot-dashed line. Each colored dot represents a different ^24 bin, with dimmer sources at 
the blue end of the spectrum and brighter sources represented by redder colors. We see that our stacked colors are consistent with those 
of the DALE galaxies and Mrk 231, but not with Arp 220. 
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Fig. 5a. — SED fits to stacked fluxes in eacii ^24 bin at a redsliift < z < 0.4. The region spanned by the best-fit SEDs from each 
Hbrary is shaded. Non- detections are marked as u pper Umits, and the errors on the rest of the points are smaller than the size of the dot. 
Libraries used are from lDale fc Heloul (solid line) . [Chary fc Elba^ (dotted line) . [Lagache et al.l (dashed line), and lSiebenmorgen fc Kriigell 
(dot-dashed line). Figures 5b— 5h are available in the online version of the Journal. 
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Fig. 5b. — SED fits to stacked fluxes in eacti 524 bin at a redsliift 0.4 < z < 0.6. Tlie region spanned by tlie best-fit SEDs from each 
library is shaded. Non-detections are marked as u pper limits, and the errors on the rest of the points are smaller than the size of the dot. 
Libraries used are from .Dale fc Heloul (solid line) . [Chary fc Elba^ (dotted line) . [Lagache et al.l (dashed line), and lSiebenmorgen fc Kriigell 
(dot-dashed line). 
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Fig. 5c. — SED fits to stacked fluxes in each S24, bin at a redshift 0.6 < z < 0.8. The region spanned by the best-fit SEDs from each 
hbrary is shaded. Non-detections are marked as u pper Umits, and the errors on the rest of the points are smaller than the size of the dot. 
Libraries used are from jale fc Heloul (solid linel. lCharv fc Elba^ (dotted line) . [Lagache et al.l (dashed line), and lSiebenmorgen fc Kriigell 
(dot-dashed line). 
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Fig. 5d. — SED fits to stacked fluxes in eacii 524 bin at a redshift 0.8 < z < 1.0. The region spanned by tiie best-fit SEDs from each 
library is shaded. Non-detections are marked as u pper limits, and the errors on the rest of tlie points are smaller than the size of the dot. 
Libraries used are from .Dale fc Heloul (solid line) . [Chary fc Elba^ (dotted line) . [Lagache et al.l (dashed line), and lSiebenmorgen fc Kriigell 
(dot-dashed line). 
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Fig. 5e. — SED fits to stacked fluxes in each S24 bin at a redsiiift 1.0 < 2: < 1.2. The region spanned by the best-fit SEDs from each 
Hbrary is shaded. Non-detections are marked as u pper limits, and the errors on the rest of the points are smaller than the size of the dot. 
Libraries used are from .Dale fc Heloul (solid linel. lCharv &: Elba^ (dotted line) . [Lagache et al.l (dashed line), and lSiebenmorgen fc Kriigell 
(dot-dashed line). 
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Fig. 5f. — SED fits to stacked fiuxes in each 524 bin at a redshift 1.2 < z < 1.6. The region spanned by the best-fit SEDs from each 
library is shaded. Non-detections are marked as u pper Umits, and the errors on the rest of the points are smaller than the size of the dot. 
Libraries used are from .Dale fc Heloul (solid line) . [Chary fc Elba^ (dotted line) . [Lagache et al.l (dashed line), and lSiebenmorgen fc Kriigell 
(dot-dashed line). 
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Fig. 5g. — SED fits to stacked fluxes in each 524 bin at a redshift 1.6 < z < 2.0. The region spanned by the best-fit SEDs from each 
library is shaded. Non-detections are marked as u pper limits, and the errors on the rest of tlie points are smaller than the size of the dot. 
Libraries used are from .Dale fc Heloul (solid line) . [Chary fc Elba^ (dotted line) . [Lagache et al.l (dashed line), and lSiebenmorgen fc Kriigell 
(dot-dashed line). 
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Fig. 5h. — SED fits to stacked fluxes in eacii 524 bin at a redshift 2.0 < z < 3.0. Tlie region spanned by the best-fit SEDs from eacli 
library is sfiaded. Non-dctcctions are marked as u pper limits, and the errors on the rest of the points are smaller than the size of the dot. 
Libraries used are from .Dale fc Heloul (solid line) . [Chary fc Elbad (dotted line) . [Lagache et al.l (dashed line), and lSiebenmorgen fc Kriigell 
(dot-dashed line). 
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